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SEQUENCE LISTING 



7 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 



Bujard, Hermann 
Gossen, Manfred 

(ii) TITLE OF INVENTION : Tetracycline-Inducible Transcriptional 

Inhibitor Fusion Proteins 

(iii) NUMBER OF SEQUENCES : 37 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 28 State Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/921,650 

(B) FILING DATE: 03-AUG-2001 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/489,777 

(B) FILING DATE: 24-JAN-2000 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 09/162,184 

(B) FILING DATE: 28-SEP-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/485,978 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/383,754 

(B) FILING DATE: 03-FEB-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/275,876 

(B) FILING DATE: 15-JULY-1994 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/270,637 

(B) FILING DATE: 01-JULY-1994 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/260,452 

(B) FILING DATE: 14-JUNE-1994 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/076,327 

(B) FILING DATE: 14-JUNE-1993 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/076,726 

(B) FILING DATE: 14-JUNE-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. Jr. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE /DOCKET NUMBER: BBI -00 9C6CNDVCN 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7 4 00 

(B) TELEFAX: (617)742-4214 

I) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 1 . . 1008 

(ix) FEATURE: 

(A) NAME /KEY: mRNA 

(B) LOCATION: 1..1008 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 1..207 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 208.. 335 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1005 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



48 



96 



144 



192 



240 



ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
1 5 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 
Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
bb 70 75 80 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 HO 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
i3 ° 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 4 80 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 isn ice 

13U 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 57 6 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC GCG 62 4 
He lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

TAC AGC CGC GCG CGT ACG AAA AAC AAT TAC GGG TCT ACC ATC GAG GGC 67 2 
Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr He Glu Gly 
210 215 220 



336 



384 



432 
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CTG CTC GAT 
Leu Leu Asp 
225 

GCT CCG CGC 
Ala Pro Arg 



ACG GCC CCC 
Thr Ala Pro 



GGC GAG GAC 
Gly Glu Asp 
275 

CTG GAC ATG 
Leu Asp Met 
290 

CAC GAC TCC 
His Asp Ser 
305 

GAG CAG ATG 
Glu Gin Met 



CTC CCG GAC 
Leu Pro Asp 
230 

CTG TCC TTT 
Leu Ser Phe 
245 

CCG ACC GAT 
Pro Thr Asp 
260 

GTG GCG ATG 
Val Ala Met 



TTG GGG GAC 
Leu Gly Asp 



GCC CCC TAC 
Ala Pro Tyr 
310 

TTT ACC GAT 
Phe Thr Asp 
325 



GAC GAC GCC 
Asp Asp Ala 



CTC CCC GCG 
Leu Pro Ala 



GTC AGC CTG 
Val Ser Leu 
265 

GCG CAT GCC 
Ala His Ala 
280 

GGG GAT TCC 

Gly Asp Ser 
295 

GGC GCT CTG 

Gly Ala Leu 



CCC CTT GGA 
Pro Leu Gly 



CCC GAA GAG 
Pro Glu Glu 
235 

GGA CAC ACG 
Gly His Thr 
250 

GGG GAC GAG 
Gly Asp Glu 



GAC GCG CTA 
Asp Ala Leu 



CCG GGT CCG 
Pro Gly Pro 
300 

GAT ATG GCC 
Asp Met Ala 
315 

ATT GAC GAG 
lie Asp Glu 
330 



GCG GGG CTG 
Ala Gly Leu 



CGC AGA CTG 
Arg Arg Leu 
255 

CTC CAC TTA 
Leu His Leu 
270 

GAC GAT TTC 
Asp Asp Phe 
285 

GGA TTT ACC 
Gly Phe Thr 



GAC TTC GAG 
Asp Phe Glu 



TAC GGT GGG 
Tyr Gly Gly 
335 



GCG 720 

Ala 

240 

TCG 768 
Ser 



GAC 816 
Asp 



GAT 8 64 
Asp 



CCC 912 
Pro 



TTT 960 

Phe 

320 

TAG 1008 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 . 60 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 



Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 



Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 
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Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr lie Glu Gly 
210 215 220 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 

Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 

Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 

Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
305 310 315 320 



Glu Gin Met Phe Thr Asp Pro Leu Gly lie Asp Glu Tyr Gly Gly 
325 330 335 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE 



DESCRIPTION: SEQ ID NO : 3 : 



GAC GCG CTA GAC GAT 
Asp Ala Leu Asp Asp 
1 5 



TTC GAT CTG GAC ATG TTG 
Phe Asp Leu Asp Met Leu 



33 



10 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Pro Lys Arg Pro Arg Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GAATTCGGGG CCGCGGAGGC TGGATCGGTC CCGGTGTCTT CTATGGAGGT CAAAACAGCG 60 
TGGATGGCGT CTCCAGGCGA TCTGACGGTT CACTAAACGA GCTCTGCTTA TATAGGTCGA 12 0 



1 



5 



10 
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GTTTACCACT 



CCCTATCAGT GATAGAGAAA AGTGAAAGTC GAGTTTACCA CTCCCTATCA 



180 



GTGATAGAGA 



AAAGTGAAAG TCGAGTTTAC CACTCCCTAT CAGTGATAGA GAAAAGTGAA 



240 



AGTCGAGTTT 



ACCACTCCCT ACCAGTGATA GAGAAAAGTG AAAGTCGAGT TTACCACTCC 



300 



CTATCAGTGA 



TAGAGAAAAG TGAAAGTCGA GTTTACCACT CCCTATCAGT GATAGAGAAA 



360 



AGTGAAAGTC GAGTTTACCA CTCCCTATCA GTGATAGAGA AAAGTGAAAG TCGAGCTCGG 4 20 

TACCCGGGTC GAGTAGGCGT GTACGGTGGG AGGCCTATAT AAGCAGAGCT CGTTTAGTGA 4 80 

ACCGTCAGAT CGCCTGGAGA CGCCATCCAC GCTGTTTTGA CCTCCATAGA AGACACCGGG 54 0 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AGATCTGCAG GGTCGCTCGG TGTTCGAGGC CACACGCGTC ACCTTAATAT GCGAAGTGGA 60 

CCGGATCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 120 

ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 180 

AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 24 0 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 300 

TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA 3 60 

GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 4 20 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 4 80 

AAGACACCGG GACCGATCCA GCCTCCGCGG CCCCGAATTC 52 0 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



ACCGATCCAG 



CCTCCGCGGC CCCGAATTC 



569 



(ii) MOLECULE TYPE: DNA (genomic) 



• 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: K12, Towne 

(ix) FEATURE: 

(A) NAME /KEY: mRNA 

(B) LOCATION: 382.. 450 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GAATTCCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 60 

ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 120 

AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 180 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 24 0 

TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA 30 0 

GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 360 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 420 

AAGACACCGG GACCGATCCA GCCTCCGCGG 4 50 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: Towne 

(ix) FEATURE: 

(A) NAME /KEY: mRNA 

(B) LOCATION: 382.. 450 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCTCG ACCCGGGTAC CGAGCTCGAC TTTCACTTTT CTCTATCACT GATAGGGAGT 60 

GGTAAACTCG ACTTTCACTT TTCTCTATCA CTGATAGGGA GTGGTAAACT CGACTTTCAC 120 

TTTTCTCTAT CACTGATAGG GAGTGGTAAA CTCGACTTTC ACTTTTCTCT ATCACTGATA 180 

GGGAGTGGTA AACTCGACTT TCACTTTTCT CTATCACTGA TAGGGAGTGG TAAACTCGAC 24 0 



TTTCACTTTT CTCTATCACT GATAGGGAGT GGTAAACTCG ACTTTCACTT TTCTCTATCA 30 0 
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CTGATAGGGA GTGGTAAACT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 3 60 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 4 20 

AAGACACCGG GACCGATCCA GCCTCCGCGG 



(2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Herpes Simplex Virus 

(B) STRAIN: KOS 



(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



450 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


10 : 






GAGCTCGACT 


TTCACTTTTC 


TCTATCACTG 


ATAGGGAGTG 


GTAAACTCGA 


CTTTCACTTT 


60 


TCTCTATCAC 


TGATAGGGAG 


TGGTAAACTC 


GACTTTCACT 


TTTCTCTATC 


ACTGATAGGG 


120 


AGTGGTAAAC 


TCGACTTTCA 


CTTTTCTCTA 


TCACTGATAG 


GGAGTGGTAA 


ACTCGACTTT 


180 


CACTTTTCTC 


TATCACTGAT 


AGGGAGTGGT 


AAACTCGACT 


TTCACTTTTC 


TCTATCACTG 


240 


ATAGGGAGTG 


GTAAACTCGA 


CTTTCACTTT 


TCTCTATCAC 


TGATAGGGAG 


TGGTAAACTC 


300 


GAGATCCGGC 


GAATTCGAAC 


ACGCAGATGC 


AGTCGGGGCG 


GCGCGGTCCG 


AGGTCCACTT 


360 


CGCATATTAA 


GGTGACGCGT 


GTGGCCTCGA 


ACACCGAG 






398 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
ACTTTATCAC TGATAAACAA ACTTATCAGT GATAAAGA 



38 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ACTCTATCAT TGATAGAGTT CCCTATCAGT GATAGAGA 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGCTTATCAT CGATAAGCTA GTTTATCACA GTTAAATT 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ACTCTATCAT TGATAGGGAA CTCTATCAAT GATAGGGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: 



DNA 



10 



15 



20 



35 
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55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AATCTATCAC TGATAGAGTA CCCTATCATC GATAGAGA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 



96 



144 



192 



240 



ATG TCT AGA TTA GAT AAA ACT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 

5 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
^ Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu £a Gin 

0 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Tr P His Val L^ Tsl L^s 

30 45 

Arc SI T TTG t CTC f C ^ " A GCC ^ GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 

5U 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 
Thr Hrs Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 

70 75 80 

40 A^n f G S T AGA TGT GCT TTA CTA AGT CAT GAT GGA 288 

40 Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 

85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 
Ala Lys Val Hxs Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
50 115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 Y 

vll ITu Glf f T ^ GTC GCT ^ GAA GAA AGG GAA ACA 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

±43 150 



336 



384 



432 



480 



155 



160 
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CCT ACT ACT GAT ACT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 57 6 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
!80 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 
lie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 17: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 
1 5 10 15 

Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 

65 7f) -ir- ^ 

/u 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
1Q 0 105 HO 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 i 6 o 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 
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Ile He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 4 8 

Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 
15 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 9 6 

Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 14 4 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 24 0 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 28 8 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 33 6 

Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 38 4 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 4 32 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 



GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 



480 
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CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 52 8 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 57 6 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150' 155 160 



Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 
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Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GAC ATG GAA AAA GCG ACA CCG GAG ACG ATG GTC CAT TGG ATT TGT CTG 4 8 

Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp lie Cys Leu 

15 10 15 

AAG ATG GAG CCA GCT CTG TGG ATG GCC ATT ACA GCA ACA TCG CAC GGC 9 6 

Lys Met Glu Pro Ala Leu Trp Met Ala lie Thr Ala Thr Ser His Gly 

20 25 30 

GCA AGG CAC AGG ACA TTC GTC GGG TTT TCC GGC TGC CTC CAC CGC AAA 14 4 

Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 

35 40 45 

TCC CTC ACG TAC CCA GTG ATA TGC CTG AGC AAA CCG AGC CAG AGG ATT 192 

Ser Leu Thr Tyr Pro Val lie Cys Leu Ser Lys Pro Ser Gin Arg lie 

50 55 60 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp lie Cys Leu 
15 10 15 

Lys Met Glu Pro Ala Leu Trp Met Ala lie Thr Ala Thr Ser His Gly 
20 25 30 



Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 
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Ser Leu Thr Tyr Pro Val He Cys Leu Ser Lys Pro Ser Gin Arg He 

55 60 



50 



(2) I N FORMAT I ON FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CTG GAC GAC TCG AAG CGC GTA GCC AAG CGG AAG CTG ATC GAG GAG AAC 
Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu He Glu Glu Asn 
1 5 10 

CGG GAG CGG CGA CGC AAG GAG GAG ATG ATC AAA TCC CTG CAG CAC CGG 
Arg Glu Arg Arg Arg Lys Glu Glu Met He Lys Ser Leu Gin His Arg 
20 25 

CCC AGC CCC AGC GCA GAG GAG TGG GAG CTG ATC CAC GTG GTG ACC GAG 
Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu He His Val Val Thr Glu 
35 40 45 

GCG CAC CGC AGC ACC AAC GCG CAG GGC AGC CAC TGG AAG CAG AGG AGG 
tit Ss Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 
50 55 60 

AAA TTC CTG CTC GAA GAT ATC GGT CAG TCG CCC ATG GCC TCC ATG CTT 
J£ Phe Leu Leu Glu Asp He Gly Gin Ser Pro Met Ala Ser Met Leu 
65 ™ 75 

GAC GGG GAC AAA GTG GAC CTG GAG GCG TTC AGC GAG TTT ACA AAA ATC 
Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He 
85 90 

ATC ACG CCG GCC ATC ACC CGC GTG GTC GAC TTT GCC AAA AAC CTG CCC 
Vhr Pro Ala He Thr Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
100 105 

ATG TTC TCG GAG CTG CCG TGC GAG GAT CAG ATC ATC CTG CTG AAG GGC 
Zt Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly 
115 120 125 

TCC TGC ATG GAG ATC ATG TCG CTG CGC GCC GCC GTG CGC TAC GAC CCC 
Cys Cys Set Glu He Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro 
130 135 I 40 

GAG AGC GAA ACG CTG ACG CTG AGC GGG GAA ATG GCC GTC AAA CGC GAG 
Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 150 I 55 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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CAG TTG AAG 
Gin Leu Lys 



CTC GGC AAG 
Leu Gly Lys 



CTG CTG CAG 
Leu Leu Gin 
195 

TGC GTG GAT 
Cys Val Asp 
210 

GAG CAC TAC 

Glu His Tyr 
225 

AAG CTG CTG 

Lys Leu Leu 



GCC AGC CGC 
Ala Ser Arg 



AAC GGA GGG 
Asn Gly Gly 
165 

TCG CTG TCT 
Ser Leu Ser 
180 

GCC GTG CTG 
Ala Val Leu 



AAG ATA GAG 
Lys lie Glu 



ATC AAC TAC 
lie Asn Tyr 
230 

ATG AAG GTG 
Met Lys Val 
245 

TTC CTG CAC 
Phe Leu His 
260 



CTG GGG GTC 
Leu Gly Val 



GCC TTC AAC 
Ala Phe Asn 
185 

CTC ATG TCC 
Leu Met Ser 
200 

AAG TGC CAG 
Lys Cys Gin 
215 

CGC AAA CAC 
Arg Lys His 



GCG GAC CTG 
Ala Asp Leu 



ATG AAG GTG 
Met Lys Val 
265 



-91 - 

GTG TCT GAT 
Val Ser Asp 
170 

CTG GAC GAC 
Leu Asp Asp 



TCA GAC CGG 
Ser Asp Arg 



GAG TCG TAC 
Glu Ser Tyr 
220 

AAC ATT CCC 
Asn lie Pro 
235 

CGC ATG ATC 
Arg Met lie 
250 

GAG TGC CCC 
Glu Cys Pro 



GCC ATC TTC 
Ala lie Phe 
175 

ACC GAG GTG 
Thr Glu Val 
190 

ACG GGG CTG 
Thr Gly Leu 
205 

CTG CTG GCG 
Leu Leu Ala 



CAC TTC TGG 
His Phe Trp 



GGC GCC TAC 
Gly Ala Tyr 
255 

ACC GAG CTC 
Thr Glu Leu 
270 



GAC 52 8 

Asp 



GCC 57 6 

Ala 



ATC 62 4 

He 



TTC 672 
Phe 



TCC 720 

Ser 

240 

CAC 7 68 

His 



TCC 816 
Ser 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu He Glu Glu Asn 
15 10 15 

Arg Glu Arg Arg Arg Lys Glu Glu Met He Lys Ser Leu Gin His Arg 
20 25 30 

Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu He His Val Val Thr Glu 
35 40 45 

Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 
50 55 60 

Lys Phe Leu Leu Glu Asp He Gly Gin Ser Pro Met Ala Ser Met Leu 
65 70 75 80 



Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He 
85 90 95 
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Ile Thr Pro Ala Ile Thr Arg Val Ma l ^ ^ n l ^ p ^ 

105 110 
»t S., Slu Leu Pro cya Rsp ue ie(] Lm 

125 

Cys Cys Met Glu Ile Met 5 Le u Ar g Ala Ala Val Arg Tyr Agp ^ 

US 14Q 

Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Ar g Glu 

150 155 



160 

Gin Leu Lys Asn Gly Gly Leu Gly Val Val Ser Asp Ala lie Phe Asp 
15 170 175 

Leu Gly Lys Se Leu Ser Alg phe ^ ^ ^ ^ ^ ^ ^ 

185 19Q 

Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp A rg Thr Gly Leu Ile 

zuu 205 

Cys Val Asp Lys ue Glu L ys Cys Gin Glu Ser Tyr Leu Leu Ala Phe 

zis 220 

Glu His Tyr Il e Asn Tyr Arg Lys Hls Asn He Pro His Phe Trp Ser 

235 



20 



25 



240 

^ 2 L r 5 S Ala AS P Leu ^ Met lie Gly Ala Tyr His 

30 ^ 250 255 

Ala Ser Ar g Phe Leu His Met Lys Val Glu Cys Pro Thr Glu Leu Ser 



265 270 



35 (2) INFORMATION FOR SEQ ID NO: 24 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 25 base pairs 
(Bj TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCCCCGGGTA ACTAAGTAAG GATCC 

(2) INFORMATION FOR SEQ ID NO: 25: 



25 



(i) SEQUENCE CHARACTERISTICS: 
cc < A > LENGTH : 24 base pairs 

< B > TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
AGTGGGTCCC CGGGTGACAT GGAA 2 4 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ser Gly Ser Pro Gly Asp Met Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGTGGGTCCC CGGGTCTGGA CGAC 2 4 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser Gly Ser Pro Gly Leu Asp Asp 
1 5 
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(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 

15 



1 5 10 



Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 

70 75 80 

Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 10 5 



110 



Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

150 155 16Q 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 
5 Met Ala Ar g Leu Asn Arg Glu Ser Val ^ ^ ^ ^ ^ ^ ^ 

Leu As n Glu Thr Gly Ile Asp Glu Thr ^ ^ ^ ^ ^ ^ 



25 30 



10 Lys Leu Gly l le Glu Gln Pro Th Leu Tyr ^ ^ ^ ^ ^ 

qU 45 

Arg Ala Leu Leu Asp Ala Leu Ala Val Clu Il e Leu Ala Arg His His 
15 " 60 

Asp Tyr ser Leu Pro „. „. sly „„ Sec ^ ^ ^ ^ ^ 

2Q As„ As„ Ale Met s „ phe flrg ftrg AU t ^ ^ ^ Gi 8 " 

90 95 
«. v.! HI. Leu Gly Thr Srg flsp ^ ^ ^ ^ ^ 

25 ».! Glu ™ r eln Leu fltg Phe Thr elu 

i^u 125 

Gly Leu Tyr Ala lie Ser Ala Val Ser His Phe Thr Leu Gly Ala 

30 135 140 

»J1 Leu Glu 01n Gln Hi= Thr flla ^ flsp ^ ^ ^ 

w Ala Pro Asp Giu ? Leu pto l ^ flrg flia 60 

17 0 

"et Asp ser Asp Asp Gly Glu Gln „. p „. Leu ^ My ^ ^ ^ 

40 Leu u. Gly Phe G1 „ val mr Ma l ^ l ^ ^ ^ 

/uu 205 
Gly Gly Asp Lys Leu He Ile Pro Phe Cys 
Zl ° 215 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Thr Lys Leu Gln Pro Asn Thr Val He Arg Ala Ala Leu 



10 15 



Asp Leu 
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L eu Asn Glu Val Gly Val Asp Gly Leu Thr Thr Arg Lys Leu Ala Glu 

20 " 

5 Arg le U Gly V,! on Gin Pro Ala Le U Tyr »rp HI. Ph« «n Lys 

35 4U 
Ar g Ala leu Leu Asp Ala Leu Ala Glu Ala Met Leu Ala Glu Asn His 
50 55 

10 



hr His Ser Val Pro Arg Ala As P Asp Asp Trp Arg Ser Phe Leu lie 
65 70 
Gly Asn Ala Arg Ser Phe Arg Gin Ala Leu Leu Ala Tyr Arg Asp Gly 

15 85 

lie His Ala Gly Thr Arg Pro Gly Ala Pro Gin Met Glu Thr 



Ala Arg — - 

100 



20 Ala Asp Ala Gin Leu Arg Phe Leu Cys Glu Ala Gly P 



ly Phe Ser Ala Gly 



115 



25 



Asp Ala val Asn Ala Leu Met Thr He Ser Tyr Phe Thr Val Gly Ala 
130 135 
25 val Leu Glu Glu Gin Ala Gly Asp Ser Glu Ser Gly Glu Arg Gly Gly 
45 150 

Val Glu Gin Ala Pro Leu Ser Pro Leu Leu Arg Ala Ala lie Asp 



Thr 



o 165 

Ma Phe Asp Glu Ala Gly Pro Asp Ala Ala Phe Glu Gin Gly Leu Ala 

180 1Hb 

35 Val He Val «P Gly le» Ala Lys «„ «, leu Val V.l «, «n Val 

195 ^ uu 



Glu Gly Pro Arg Lys Gly Asp Asp 
210 215 

40 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
Met Asn Lys Leu Gin Arg Glu Ala Val lie Arg Thr Ala Leu Gly Leu 

1 5 
L eu Asn Asp Val Gly Met Glu Gly Leu Thr Thr Arg Arg Leu Ala Glu 



20 25 



Attorney Docket No. BW009C6CNDVCN 

-97- 



Arg Leu Gly Val 
35 

Arg Ala Leu Leu 
50 

Thr His Ser Thr 
65 

Gly Asn Ala Cys 



Ala Arg lie His 
100 

Ala Asp Ala Gin 
115 

Asp Ala Thr Tyr 
130 

Val Leu Glu Gin 
145 

Asp Gin Leu Thr 



Ala Met Lys lie 
180 

Gly Leu Ala Leu 
195 

Asn Asp lie Glu 
210 



Gin Gin Pro Ala 
40 

Asp Ala Leu Ala 
55 

Pro Arg Asp Asp 
70 

Ser Phe Arg Arg 
85 

Ala Gly Thr Arg 



Leu Arg Phe Leu 
120 

Ala Leu Met Ala 
135 

Gin Ala Ser Glu 
150 

Thr Ser Ala Ser 
165 

Val Tyr Glu Ala 



He He Gly Gly 
200 

Val Leu Lys Asn 
215 



Leu Tyr Trp His 



Glu Ala Met Leu 
60 

Asp Asp Trp Arg 
75 

Ala Leu Leu Ala 
90 

Pro Ala Ala Pro 
105 

Cys Asp Ala Gly 



He Ser Tyr Phe 
140 

Ala Asp Ala Glu 
155 

Thr Met Pro Ala 
170 

Gly Pro Asp Ala 
185 

Leu Glu Lys Met 



Val Asp Glu 



Phe Lys Asn Lys 
45 

Thr He Asn His 



Ser Phe Leu Lys 
80 

Tyr Arg Asp Gly 
95 

Gin Met Glu Lys 
110 

Phe Ser Ala Gly 
125 

Thr Val Gly Ala 



Glu Arg Gly Glu 
160 

Arg Leu Gin Ser 
175 

Ala Phe Glu Arg 
190 

Arg Leu Thr Thr 
205 



(2) INFORMATION FOR SEQ ID NO:33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Met Thr Lys Leu 
1 

Leu Asn Glu Val 
20 

Arg Leu Lys Val 
35 

Arg Ala Leu Leu 
50 



Asp Lys Gly Thr 

5 

Gly Met Asp Ser 

Gin Gin Pro Ala 
40 

Asp Ala Leu Pro 
55 



Val He Ala Ala 
10 

Leu Thr Thr Arg 
25 

Leu Tyr Trp His 



Glu Ala Met Leu 
60 



Gly Leu Glu Leu 
15 

Lys Leu Ala Glu 
30 

Phe Gin Asn Lys 
45 

Arg Glu Arg His 
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Thr Arg Ser 
65 

Glu Asn Ala 



Ala Arg lie 



Leu Pro 



Leu Ser 
85 

His Ala 
100 



Glu Glu Asn 
70 

Phe Arg Thr 
Gly Thr Arg 



Glu Asp Trp 

75 

Ala Leu Leu 
90 

Pro Thr Glu 
105 



Arg Val 
Ser Tyr 
Pro Asn 



Phe Leu Lys 
80 

Arg Asp Gly 
95 

Phe Gly Thr 
110 



Ala Glu Thr 
115 

Arg Ala Val 
130 



Gin lie 



Trp Ala 



Arg Phe Leu 
120 

Leu Arg Ala 
135 



Cys Ala Glu 



Val Ser His 



Gly Phe 
125 

Tyr Val 
140 



Cys Pro Lys 
Val Gly Ser 



Val Leu Glu 
145 



Gin Gin 



Ala Ser Asp 
150 



Ala Asp Glu 
155 



Arg Val 



Pro Asp Arg 
160 



Pro Asp Val 



Ser Glu 
165 



Gin Ala Pro 



Ser Ser Phe 
170 



Leu His 



Val Leu Phe 
175 



His Glu Leu 



Glu Thr 
180 



Asp Gly Met 



Asp Ala Ala 
185 



Phe Asn 



Phe Gly Leu 
190 



Asp Ser Leu 
195 



lie Ala 



Gly Phe Glu 
200 



Arg Leu Arg 



Ala Ala 

205 



Val Leu Ala 



Thr Asp 
210 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Ala Arg Leu Ser Leu Asp Asp Val lie Ser Met Ala Leu Thr Leu 
15 10 15 

Leu Asp Ser Glu Gly Leu Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Ser Leu Lys lie Glu Gin Pro Thr Leu Tyr Trp His Val Arg Asn Lys 
35 40 45 

Gin Thr Leu Met Asn Met Leu Ser Glu Ala lie Leu Ala Lys His His 
50 55 60 



Thr Arg Ser Ala Pro Leu Pro Thr Glu Ser Trp Gin Gin Phe Leu Gin 
65 70 75 80 



E^C 
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Glu Asn Ala Leu Ser Phe Arg Lys Ala Leu Leu Val His Arg Asp Gly 
85 90 95 

Ala Arg Leu His lie Gly Thr Ser Pro Thr Pro Pro Gin Phe Glu Gin 
5 100 105 110 

Ala Glu Ala Gin Leu Arg Cys Leu Cys Asp Ala Gly Phe Ser Val Glu 
115 120 125 

10 Glu Ala Leu Phe lie Leu Gin Ser lie Ser His Phe Thr Leu Gly Ala 
130 135 140 



15 



30 



45 



50 



55 



Val Leu Glu Glu Gin Ala Thr Asn Gin lie Glu Asn Asn His Val lie 

1^5 150 155 160 

Asp Ala Ala Pro Pro Leu Leu Gin Glu Ala Phe Asn lie Gin Ala Arg 

165 170 175 



Thr Ser Ala Glu Met Ala Phe His Phe Gly Leu Lys Ser Leu lie Phe 
20 180 185 190 

Gly Phe Ser Ala Gin Leu Asp Glu Lys Lys His Thr Pro lie Glu Asp 
195 200 205 

25 Gly Asn Lys 
210 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

agttctatgt ctagattaga taaaagtaaa gtgattaaca gcgcattaga gctgcttaat 60 

gaggtcggaa tcgaaggttt aacaacccgt aaactcgccc agaagctagg tgtagagcag 120 

cctacattgt attggcatgt aaaaaataag cgggctttgc tcgacgcctt agccattgag 180 

atgttagata ggcaccatac tcacttttgc cctttagaag gggaaagctg gcaagatttt 240 

ttacgtaata aggctaaaag ttttagatgt gctttactaa gtcatcgcga tggagcaaaa 300 

gtacatttag gtacacggcc tacagaaaaa cagtatgaaa ctctcgaaaa tcaattagcc 360 

tttttatgcc aacaaggttt ttcactagag aatgcattat atgcactcag cgctgtgggg 420 

cattttactt taggttgcgt attggaagat caagagcatc aagtcgctaa agaagaaagg 480 

gaaacaccta ctactgatag tatgccgcca ttattacgac aagctatcga attatttgat 540 
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caccaaggtg cagagccagc cttcttattc ggccttgaat tgatcatatg cggattagaa 600 
aaacaactta aatgtgaaag tgggtccgac atggaaaaag cgacaccgga gacgatggtc 660 
cattggattt gtctgaagat ggagccagct ctgtggatgg ccattacagc aacatcgcac 720 
ggcgcaaggc acaggacatt cgtcgggttt tccggctgcc tccaccgcaa atccctcacg 780 

819 

tacccagtga tatgcctgag caaaccgagc cagaggatt 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
atgtctatgt ctagattaga taaaagtaaa gtgattaaca gcgcattaga gctgcttaat 60 
gaggtcggaa tcgaaggttt aacaacccgt aaactcgccc agaagctagg tgtagagcag 120 
cctacattgt attggcatgt aaaaaataag cgggctttgc tcgacgcctt agccattgag 180 
atgttagata ggcaccatac tcacttttgc cctttagaag gggaaagctg gcaagatttt 240 
ttacgtaata aggctaaaag ttttagatgt gctttactaa gtcatcgcga tggagcaaaa 300 
gtacatttag gtacacggcc tacagaaaaa cagtatgaaa ctctcgaaaa tcaattagcc 360 
tttttatgcc aacaaggttt ttcactagag aatgcattat atgcactcag cgctgtgggg 420 
cattttactt taggttgcgt attggaagat caagagcatc aagtcgctaa agaagaaagg 480 
gaaacaccta ctactgatag tatgccgcca ttattacgac aagctatcga attatttgat 540 
caccaaggtg cagagccagc cttcttattc ggccttgaat tgatcatatg cggattagaa 600 
aaacaactta aatgtgaaag tgggtcctcc ccgggtaact aagtaaggat cc 652 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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atgtctatgt ctagattaga taaaagtaaa gtgattaaca gcgcattaga gctgcttaat 60 
gaggtcggaa tcgaaggttt aacaacccgt aaactcgccc agaagctagg tgtagagcag 120 
cctacattgt attggcatgt aaaaaataag cgggctttgc tcgacgcctt agccattgag 180 
atgttagata ggcaccatac tcacttttgc cctttagaag gggaaagctg gcaagatttt 240 
ttacgtaata aggctaaaag ttttagatgt gctttactaa gtcatcgcga tggagcaaaa 300 
gtacatttag gtacacggcc tacagaaaaa cagtatgaaa ctctcgaaaa tcaattagcc 360 
tttttatgcc aacaaggttt ttcactagag aatgcattat atgcactcag cgctgtgggg 420 
cattttactt taggttgcgt attggaagat caagagcatc aagtcgctaa agaagaaagg 480 
gaaacaccta ctactgatag tatgccgcca ttattacgac aagctatcga attatttgat 540 
caccaaggtg cagagccagc cttcttattc ggccttgaat tgatcatatg cggattagaa 600 
aaacaactta aatgtgaaag tgggtccctg gacgactcga agcgcgtagc caagcggaag 660 
ctgatcgagg agaaccggga gcggcgacgc aaggaggaga tgatcaaatc cctgcagcac 720 
cggcccagcc ccagcgcaga ggagtgggag ctgatccacg tggtgaccga ggcgcaccgc 780 
agcaccaacg cgcagggcag ccactggaag cagaggagga aattcctgct cgaagatatc 840 
ggtcagtcgc ccatggcctc catgcttgac ggggacaaag tggacctgga ggcgttcagc 900 
gagtttacaa aaatcatcac gccggccatc acccgcgtgg tcgactttgc caaaaacctg 960 
cccatgttct cggagctgcc gtgcgaggat cagatcatcc tgctgaaggg ctgctgcatg 1020 
gagatcatgt cgctgcgcgc cgccgtgcgc tacgaccccg agagcgaaac gctgacgctg 1080 
agcggggaaa tggccgtcaa acgcgagcag ttgaagaacg gagggctggg ggtcgtgtct 1140 
gatgccatct tcgacctcgg caagtcgctg tctgccttca acctggacga caccgaggtg 1200 
gccctgctgc aggccgtgct gctcatgtcc tcagaccgga cggggctgat ctgcgtggat 1260 
aagatagaga agtgccagga gtcgtacctg ctggcgttcg agcactacat caactaccgc 1320 
aaacacaaca ttccccactt ctggtccaag ctgctgatga aggtggcgga cctgcgcatg 1380 
atcggcgcct accacgccag ccgcttcctg cacatgaagg tggagtgccc caccgagctc 1440 
50 tec 1443 



10 
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